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Abstract:  Cell-free  circulating  microRNAs  (miRNAs)  in  the  blood  are  good  diagnostic 
biomarker  candidates  for  various  physiopathological  conditions,  including  cancer, 
neurodegeneration,  diabetes  and  other  diseases.  Since  their  discovery  in  2008  as  blood 
biomarkers,  the  field  has  expanded  rapidly  with  a  number  of  important  findings.  Despite 
the  initial  optimistic  views  of  their  potential  for  clinical  application,  there  are  currently  no 
circulating  miRNA-based  diagnostics  in  use.  In  this  article,  we  review  the  status  of 
circulating  miRNAs,  examine  different  analytical  approaches,  and  address  some  of  the 
challenges  and  opportunities. 

Keywords:  diagnostic;  exosomes;  microvesicles;  quantitative  methods 


1,  Introduction 

MicroRNAs  (miRNAs)  were  originally  identified  as  small,  non-coding  RNA  mediators  of  temporal 
pattern  formation  in  C.  elegans  [1-5],  which  regulated  messenger  RNAs  (mRNAs)  by  base  pairing 
with  their  3 '-untranslated  regions  (3'-UTR).  Later  research  showed  that  miRNAs  (and  this  type  of 
RNA  mediated  regulatory  processes)  are  conserved  in  metazoan  species  [6-9]  and  have  also  been 
found  in  viruses  [10].  MiRNAs  are  currently  differentiated  from  other  small  RNAs  by:  (1)  a  hairpin 
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fold-back  precursor  structure  in  their  primary  transeript,  as  predicted  by  base  composition  (2) 
and  an  approximately  18-22  nueleotide  long  mature  sequence  [11].  At  present,  1,872  precursor 
sequences  and  2,578  mature  human  sequenees  have  been  catalogued  by  the  online  repository 
mlRBase  (v20)  [12], 

Primary  miRNA  transeripts  (pri-miRNAs)  are  transeribed  in  the  nucleus  and  processed  by  an  RNase 
III  enzyme,  Drosha  [13].  The  resulting  smaller  hairpin  pre-miRNA  is  transported  to  the  cytoplasm  via  a 
RanGTP-dependent  double  stranded  RNA  binding  protein,  exportin  5.  The  pre-miRNA  is  then  further 
proeessed  by  another  RNase  III,  Dicer  [14],  into  an  approximately  22  nucleotide  long  duplex  RNA 
with  3’  overhangs.  The  duplex  is  subsequently  unwound  by  RNA  helicase  and  usually  one  strand  (the 
guide  strand)  is  preferentially  assoeiated  with  the  effector-protein  Argonaute  [15,16],  as  part  of  the 
RNA  Indueed  Silencing  Complex  (RISC).  The  other  strand  (the  passenger  strand)  is  degraded.  The 
proeess  for  strand  selection  is  yet  to  be  fully  understood.  The  miRNA-RISC  eomplex  can  then  internet 
with  mRNA  targets  to  inhibit  translation  or  induce  degradation  of  the  mRNA,  typieally  through 
binding  at  the  3 '-untranslated  region  (3'-UTR)  with  partial  sequenee  complementarity.  The  RNase  III 
eleavages  are  often  impreeise,  resulting  in  end  region  sequence  variations  (espeeially  the  3'  end)  of 
mature  miRNAs.  In  addition  to  this,  miRNAs  ean  be  post-transcriptionally  modified  (e.g.,  via 
non-templated  additions),  resulting  in  sequence  variants  (isomiRs)  [17-20],  whieh  may  have 
functionally  distinct  properties  [21-23],  expanding  the  miRNA  repertoire. 

Through  interaeting  with  mRNA  targets,  miRNAs  predominantly  aet  to  modulate  the  transeriptome 
of  cells  [24]  and  have  been  implieated  in  various  physiopathological  proeesses,  ineluding  development, 
homeostasis  and  cellular  pathology  [25,26].  Based  on  various  prediction  algorithms,  the  majority  of 
mRNA  transcripts  are  potential  targets  for  miRNAs  [27].  MieroRNA-target  binding  is  usually 
mediated  through  the  initial  interaction  of  the  seed  region  (nueleotides  2-8  of  the  5'  end  of  the 
miRNA)  [13],  but  a  significant  fraction  (approximately  60%  based  on  one  report  [28])  of  seed 
interactions  are  non-canonical  {i.e.,  contain  bulging  or  mismatches)  and  are  generally  associated  with 
additional,  non-seed  base  pairing.  These  flexible  constraints  on  specificity  permit  a  single  miRNA  to 
target  multiple  (possibly  hundreds)  of  mRNAs  and  elieit  effeets  on  multiple  biological  processes.  A 
single  mRNA  can  also  contain  binding  sites  for  multiple  different  miRNA  sequenees.  This  makes 
miRNA-mediated  transeriptome/proteome  regulation  a  complicated  proeess  and  difficult  to  decipher. 

The  expression  of  certain  miRNAs  is  restricted  to  speeifie  cell  types  [29]  or  developmental  stages; 
therefore,  they  are  frequently  dysregulated  in  pathologies  including  cancer  [30-33],  cardiovascular 
diseases  [34-41]  and  neurological  conditions  [42-46].  In  the  case  of  caneer,  analyzing  the  miRNA 
spectrum  of  tumors  yields  qualitative  diagnostic  information  and  allows  possible  identification  of  the 
tissue  of  origin  and  in  some  eases,  provides  molecular  stratification  of  tumor  subtypes  [47-49]. 

2,  Early  Studies  of  Circulating  miRNA 

The  presence  of  cell-free  nueleic  aeids  in  blood  cireulation  has  been  established  for  almost  60 
years  [50-54].  Tumor  derived  DNA  and  RNA  are  commonly  observed  in  eancer  patient  plasma 
samples  [55-58].  RNA  was  historieally  thought  to  be  unsuitable  as  a  practical  blood-based  biomarker, 
due  to  the  high  level  of  nuclease  aetivity  in  human  plasma  [59],  but  this  skeptical  position  was 
reconsidered  after  the  observation  that  miRNAs  are  stable  in  fixed  tissues  [60]  and  that  the 
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concentration  of  miR-21  in  serum  eould  distinguish  patients  with  B-cell  lymphoma  from  eontrol 
individuals  [61].  Subsequent  work  detected  speeifie  eirculating  miRNA  signatures  in  healthy 
individuals  and  patients  with  prostate,  breast  and  coloreetal  caneers  [62-67],  in  addition  to  other 
disorders.  Detailed  examination  [62]  revealed  that  the  deteeted  speeies  were  bona  fide  eell-free 
miRNAs  and  explicitly  demonstrated  their  stability  outside  of  the  eells  (tolerating  prolonged 
ineubation  at  room  temperature  in  addition  to  multiple  cyeles  of  freezing  and  thawing  of  the  plasma 
samples).  The  release  of  miRNA  from  tumors  was  further  demonstrated  by  detecting  human  miRNA 
sequences  in  plasma  drawn  from  mice  harboring  human  xenografts.  The  field  expanded  rapidly, 
describing  numerous  eirculating  miRNA  biomarkers  for  eaneer  (reviewed  elsewhere  in  this  issue  [68], 
and  other  eonditions  ineluding  neurodegeneration,  cardiovascular  disease  and  metabolie  disease  (Table  1). 
Although  it  has  been  demonstrated  that  miRNAs  ean  be  released  into  eireulation  from  eells  speeifie  to 
the  disease  (e.g.,  miR-122  from  liver  toxicity  studies  [69,70])  the  eellular  origin  of  many  miRNA  in 
biofiuids  is  unclear.  For  example,  blood  levels  of  miRNAs  highly  expressed  in  heart  tissues,  sueh  as 
miR-208a  and  miR-499,  were  ineluded  as  a  part  of  signatures  to  differentiate  patients  with 
eardiovascular  disease  in  several  studies  [71-74].  This  suggests  that  inereased  levels  of  heart-speeific 
blood  miRNAs  may  derive  from  the  heart  and  direetly  reflect  the  heart  eondition  of  patients.  However 
whether  all  reported  miRNAs  are  actually  directly  associated  with  the  disease  or  are  the  product  of 
nonspecific,  systemic  or  secondary  responses  remains  to  be  further  evaluated. 

Table  1,  Circulating  miRNA  biomarkers  for  neurodegeneration,  cardiovascular  and 
metabolic  diseases.  Red  text:  increased  abundanee  relative  to  eontrols,  green  text:  reduced 
abundance  relative  to  eontrols.  (AD)  Alzheimer  Disease,  (CAD)  Coronary  Artery  Disease, 
(GDM)  Gestational  Diabetes  Mellitus,  (HD)  Huntington  Disease,  (IFG)  impaired  fasting 
glueose,  (IGT)  impaired  glueose  tolerance,  (MCI)  Mild  Cognitive  Impairment,  (NEC) 
Normal  Elderly  Controls,  (NGS)  Next  Generation  Sequencing,  (PBMC)  Peripheral  Blood 
Mononuclear  Cells,  (PD)  Parkinson  Disease,  (TID)  Type  1  Diabetes,  (T2D)  Type  2 
Diabetes,  (RT-qPCR)  Reverse  Transeription  Quantitative  PCR. 


Disease 

miRNA 

Detection 

Method 

Specimen 

Population 

Ref. 

Neurodegenerative  Disease 

Alzheimer  Disease 

miR-34a,  -181b,  -200a,  let-7f 

Microarray, 

RT-qPCR 

PBMC 

1 6  AD  patients  and  1 6  NEC 

matched  for  ethnicity,  age, 

gender  and  education 

[75] 

60  miRNAs  (37  higher 

abundance,  23  lower 

abundance) 

RT-qPCR 

CSF 

10  AD,  10  controls 

[76] 

miRNA-9,  -125b,  -146a,  -155 

Microarray, 

Northern  dot  blot 

CSF 

6  AD,  6  controls 

[77] 

miR-137,  -181c,  -9,  -29a,  -29b 

RT-qPCR 

Serum 

7  AD,  7  controls 

[78] 

miR-29b 

RT-qPCR 

PBMC 

393  AD,  412  controls 

[79] 
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Disease 

miRNA 

Detection 

Method 

Specimen 

Population 

Ref. 

Neurodegenerative  Disease 

Alzheimer  Disease 

miR-15a 

RT-qPCR 

Plasma,  CSF 

1 1  AD,  9  MCI,  20  NC; 

20  AD,  17NC 

[80] 

let-7d-5p,  let-7g-5p,  miR-15b- 
5p,  -142-3p,  -191-5p,  -301a-3p, 
-545-3p 

nCounter,  RT-qPCR 

Plasma 

Screening:  1 1  AD,  20  NC; 
validation  20  AD,  17  NC 

[81] 

miR-34c 

RT-qPCR 

Plasma 

110  AD,  123  NEC 

[82] 

miR-146a 

RT-qPCR 

CSF 

10  AD,  10  early  AD, 

1 1  controls 

[83] 

Parkinson  Disease 

miR-1,  -22*,  -29a,  -16-2* 

-26a2*,  -30a 

RT-qPCR 

Whole  blood 

15  PD  (8  untreated  PD), 

8  controls 

[84] 

18  miRNAs 

Microarray 

PBMC 

19  PD,  13  controls 

[85] 

miR-222,  -626,  -505 

Microarray, 

RT-qPCR 

Plasma 

32  PD,  34  controls;  42  PD, 

30  controls;  30  PD, 

8  controls 

[86] 

miR-331-5p 

RT-qPCR 

Plasma 

3 1  PD,  25  controls 

[87] 

16  miRNAs  (10  higher 

ahundanee,  6  iower 

ahundanee) 

NGS 

Blood 

leukocyte 

7  PD,  6  controls 

[88] 

Huntington 

Disease 

miR-34h 

Microarray,  RT-qPCR 

Plasma 

27  HD,  12  controls 

[89] 

Cardiovascuiar  Disease 

Coronary  Artery  Disease 

miR-126,  -17,  -92a,  -155,  -145; 

-133a,  -208a 

RT-qPCR 

Plasma, 

serum 

8  CAD,  8  controls; 

36  CAD,  17  controls 

[71] 

miR-135a  and  miR-147 

RT-qPCR 

PBMC 

50  CAD,  20  controls 

[90] 

miR-146a  and  miR-146h 

RT-qPCR 

PBMC 

41  CAD,  15  controls 

[91] 

miR-19a,  -584,  -155,  -222,  -145, 

-29a,  -378,  -342,  -181d,  -30e-5p, 

-150) 

Microarray 

RT-qPCR 

Whole  blood 

5  CAD,  5  controls  for 
initial;  10  CAD,  15 

controls  for  validation 

[92] 

miR-214 

RT-qPCR 

Plasma 

40  CAD,  15  controls 

[93] 

Acute  Myocardial 

Infarction 

miR-1,  -133a,  -499,  -208a 

Microarray,  RT-qPCR 

Plasma 

33  AMI,  30  controls 

[72] 

miR-1 

RT-qPCR 

Plasma 

93  AMI,  66  controls 

[94] 

miR-1 

RT-qPCR 

Serum 

3 1  AMI,  20  controls 

[95] 

miR-499 

Microarray, 

RT-qPCR 

Plasma 

14  AMI,  15  heart  failure 
patients,  10  controls 

[73] 

miR-126,  -197,  -223 

Microarray,  RT-qPCR 

Plasma 

820  Bruneck  cohort  [96] 

[97] 
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Disease 

miRNA 

Detection 

Method 

Specimen 

Population 

Ref. 

Cardiovascular  Disease 

Acute  Myocardial 

Infarction 

miR-1915,  -181 

RT-qPCR 

Whole  blood 

60  AMI,  21  controls, 

5  time  points  (0-24  h) 

[98] 

miR-133a 

RT-qPCR 

Plasma 

13  AMI  patients, 

176  angina  pectoris 
patients,  127  controls 

[99] 

miR-1,  -134,  -186,  -208,  -223 

and  -499 

NGS, 

RT-qPCR 

Serum 

1 17  AMI  patients,  182  AP 
patients,  100  controls 

[74] 

Congestive  Heart 

Failure 

miR-210 

RT-qPCR 

PBMC 

13  patients,  6  controls 

[100] 

miR-126 

RT-qPCR 

Plasma 

33  patients,  17  controls 

[101] 

Aortic 

Aneurysm 

miR-29b,  -124,  -155,  -223 

RT-qPCR 

Plasma 

23  patients,  12  healthy 
controls,  17  coronary 
artery  disease  patients 

[102] 

Stroke 

miR-125b-2*,  -27a,  -422a,  - 

488,  -627 

Microarray,  RT-qPCR 

Plasma 

169  stroke  patients, 

94  metabolic  syndrome 
patients,  24  healthy  controls 

[103] 

miR-145 

RT-qPCR 

Whole  Blood 

32  ischemic  stroke  patients, 

14  healthy  controls 

[104] 

Atherosclerosis 

miR-130a,  -27b,  -210 

RT-qPCR 

Serum 

104  patients,  105  controls 

[105] 

Metabolic  Disease 

Type  1 

Diabetes 

miR-152,  -30a-5p,  -181a,  -24, 

-148a,  -210,  -27a,  -29a,  -26a, 

-27b,  -25,  -200a 

NGS, 

RT-qPCR 

Serum 

pooled  from  2  TID  groups 
(275,  129)  and  one  control 
group  {n=  151) 

[106] 

Type  2  Diabetes 

miR-20b,  -21,  -24,  -15a,  -126, 

-191,  -197,  -223,  -320,  -486, 

-150,  -29b,  -28-3p 

Microarray,  RT-qPCR 

Plasma 

80  patients,  80  controls 

[107] 

miR-9,  -29a,  -30d,  -34a,  - 

124a, 

-146a  and  -375 

RT-qPCR 

Serum 

18  T2D,  19  pre-diabetes 

(IGT  and/or  IFG), 

19  controls 

[108] 
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Disease 

miRNA 

Detection 

Method 

Specimen 

Population 

Ref. 

miR-146a 

RT-qPCR 

Plasma 

90  patients, 

90  controls 

[109] 

CM 

CM 

OJ 

cd 

s 

miR-29a 

RT-qPCR 

Urine 

83  patients  (42  with 

albuminuria,  41  with 

[110] 

(N 

a> 

Oh 

normoalbuminuria) 

5 

"o 

A 

CQ 

H 

miR-24  and  miR-29b,  (miR-144  in 

Swedish  T2D) 

RT-qPCR 

Plasma 

33  patients  (14 

Swedes,  19  Iraqis), 

1 19  controls 

[111] 

Gestational 

Diabetes 

Mellitus 

miR-132,  -29a  and  miR-222 

RT-qPCR 

Serum 

24  GDM,  24 

controls 

[112] 

3,  Physical  State  and  Biological  Function 

The  stability  of  cell-free  circulating  miRNA  in  the  extracellular  environment  (e.g.,  plasma)  had 
been  established,  but  the  mechanistic  basis  of  this  resistance  to  degradation  remained  unclear. 
Synthetic  miRNA  oligonucleotides  and  miRNA  purified  from  plasma  samples  were  rapidly  degraded 
when  combined  with  non-denatured  plasma  [62],  indicating  that  miRNAs  were  not  stabilized  by 
intrinsic  factors  such  as  secondary  structure  or  chemical  modification.  This  observation  led  to  the 
hypothesis  in  the  field  that  cell-free  circulating  miRNAs  were  packaged  in  a  way  to  protect  them  from 
RNase  degradation,  possibly  in  lipid  vesicles  or  as  part  of  a  protein  complex. 

3.1.  Exosomes  and  Other  Vesicles 

Among  extracellular  lipid  vesicles,  exosomes  have  received  considerable  attention  [113-116]. 
Exosomes  are  50-100  nm  diameter  membrane  vesicles,  secreted  by  diverse  cell  types  in  vivo  and 
in  vitro.  These  vesicles  are  present  in  many  bodily  fluids  (e.g.,  blood,  saliva,  breast  milk,  seminal  fluid, 
ascites,  urine,  etc.)  [115,117],  and  have  been  shown  to  contain  characteristic  protein,  mRNA  and 
miRNA  molecules  derived  from  their  cell  of  origin.  In  addition  to  exosomes,  cells  also  produce  larger 
lipid  vesicles,  microvesicles  (1-10  pm  in  diameter).  Like  exosomes,  microvesicles  also  contain 
protein,  miRNA  and  mRNA.  Recently,  prostate  cancer  cells  have  been  observed  to  produce 
microvesicles,  which  have  been  termed  large  oncosomes  (LOs)  [118].  LOs  contain  miRNAs  associated 
with  cell  migration  and  invasion.  Apoptotic  bodies  derived  from  normal  cell  turn  over  or  from  specific 
pathologies  (e.g.,  cancer)  may  also  contribute  to  RNA  in  circulation. 

Circulating  miRNAs  may  also  be  derived  from  platelets,  which  are  abundant  in  typical  plasma 
preparations,  contain  a  large  quantity  of  miRNA  [119,120]  and  have  been  reported  to  contain  tumor- 
derived  mRNA  when  isolated  from  the  plasma  of  cancer  patients  [121].  The  authors  of  this  study 
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suggest  that  platelets  aequire  tumor-derived  mRNA  through  exosome-mediated  transfer,  although  it  is 
possible  that  platelet  preparations  may  be  eontaminated  with  eireulating  tumor  eell  debris  [122], 

Several  studies  have  observed  the  uptake  of  exosomes  and  other  mlRNA-eontaining  vesieles 
into  eells  in  vitro  [116,118,123-125],  leading  to  the  hypothesis  of  mlRNA-based  intereellular 
oommunieation  mediated  by  exosomes  (like  a  “message  in  a  bottle”  [126])  but  it  remains  unelear  if 
this  is  a  general  phenomenon. 

3.2.  Protein-miRNA  Complexes 

Multiple  studies  have  also  reported  eell-free  eireulating  miRNAs  in  non-lipid  vesiele 
forms  [127,128].  Using  differential  eentrifugation,  size-exelusion  ehromatography,  filtration  and 
immunopreeipitation  approaehes,  investigators  observed  that  a  large  fraetion  of  miRNAs  in  platelet- 
poor  plasma/serum  from  healthy  human  donors  and  eonditioned  eell  medium  were  not  assoeiated  with 
lipid  vesieles  and  were  sensitive  to  protease  digestion.  This  observation  suggested  that  miRNA  was 
proteeted  from  plasma  RNase  aetivity  by  assoeiation  with  protein.  A  substantial  amount  of  this  soluble 
miRNA  was  assoeiated  with  the  endogenous  effeetor  protein  Argonaute-2  (Ago2).  Independent  studies 
have  also  observed  miRNA  assoeiated  with  other  proteins  sueh  as  high-density  lipoprotein  [129]  and 
nueleophosmin  [130]. 

2.2.  Exogenous  miRNAs  in  Circulation 

Reeent  reports  deseribing  exogenous  miRNA  and  other  RNA  speeies  in  the  blood  of  healthy 
individuals  [131,132]  have  raised  some  interesting  questions  about  the  genesis,  funetion,  and 
eomplexity  of  eell-free  RNA  in  eireulation.  To  aeeurately  determine  the  origin  of  these  RNAs  derived 
from  exogenous  speeies  is  a  ehallenge,  due  to  the  laek  of  genomie  sequenee  information  for  many 
different  speeies.  Nevertheless,  the  preliminary  results  indieate  these  RNA  moleeules  are  probably 
derived  from  numerous  baeteria,  fungi  and  food,  whieh  suggest  that  gut  is  the  primary  souree  of  these 
sequenees.  Funetional  delivery  of  small  RNAs  through  ingestion  has  been  very  well  established  in 
nematodes  [133].  However,  tissue  barrier  systems  in  mammals  are  mueh  more  elaborate  and  it  is 
unelear  how  RNA  might  penetrate  them.  Limited  experimental  data  also  showed  that  some  of  these 
exogenous  RNAs  are  paekaged  in  lipid  vesieles  and  ean  affeet  the  eellular  transeriptome  upon  entry 
into  the  eells.  How  these  RNAs  permeated  the  gut  lining,  how  they  are  paekaged,  how  they  are 
seleeted  and  how  did  eells  reeognize  and  take  up  these  exogenous  RNA  speeies  are  just  some  of  the 
interesting  issues  are  waiting  to  be  answered.  The  finding  of  exogenous  miRNAs  in  normal  human 
blood  also  raises  some  interesting  eoneerns.  Sinee  some  miRNA  sequenees  are  extremely  eonserved 
throughout  evolution,  the  levels  of  those  miRNAs  in  eireulation  may  not  be  eompletely  eontributed 
from  human  eells.  In  addition,  pathogenie  or  adverse  effeets  indueed  by  exogenous  RNAs  are 
a  possibility. 

4.  Status  of  Circulating  miRNA  Biomarkers 

Sinee  the  establishment  of  eireulating  miRNAs  as  biomarkers  for  eaneer  in  2008  [61,62],  there  have 
been  over  500  reports  deseribing  their  biology  and  diagnostie  utility  for  various  eonditions.  Relative  to 
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other  types  of  diagnostic  biomolecules  in  circulation  (e.g.,  proteins),  miRNA  has  many  advantages 
including  tissue  or  cell  type/stage  specific  expression,  mature  nucleic  acid  detection  technologies, 
lower  cost  and  shorter  time  required  to  develop  assays  (compared  to  the  development  of  proteomic 
tools  such  as  antibodies)  and  amplifiable  signal.  Despite  these  advantages,  the  rapid  expansion  of  the 
literature,  and  significant  commercial  interest,  circulating  miRNA  biomarkers  face  an  unclear  path 
toward  clinical  application.  For  example,  an  initial  study  of  a  circulating  miRNA  panel  on  120  patients 
provided  91  percent  sensitivity  and  72  percent  specificity  to  diagnose  colon  cancer.  However,  later 
studies  yielded  inconsistent  results  [134].  The  inconsistencies  were  attributed  to  variation  in  specimen 
collection  and  handling,  which  are  well  known  to  alter  miRNA  spectrum  [120].  Additional  factors 
might  also  have  been  involved. 

5,  Considerations  for  Circulating  MiRNA  Studies 

5.1.  Prerequisites 

Ideal  prerequisites  for  biomarker  discovery/development  processes  should  include:  (1)  the  need  for 
such  biomarker,  (2)  the  intended  use  (e.g.,  type  of  tumor  V5.  tumor  progression  or  response  to 
treatment),  (3)  well  characterized  patient  and  matched  control  specimens  in  acceptable  quantity,  (4) 
samples  have  been  banked  or  can  be  prospectively  collected,  processed  and  stored  in  a  consistent 
manner  and  (5)  a  specific,  reliable  and  easy  to  implement  detection  method.  This  process  has  been 
done  for  other  clinical  tests  such  as  pathogen  screening  and  protein-based  biomarker  measurement,  but 
there  is  no  commonly  accepted  standard  for  miRNA.  In  part  this  is  due  to  the  lack  of  understanding  of 
both  intrinsic  and  extrinsic  factors  that  may  affect  the  miRNA  spectrum.  Therefore,  fundamental  and 
comprehensive  studies  are  urgently  needed  to  address  these  issues.  This  need  is  well  recognized  in  the 
community.  For  example,  the  National  Institutes  of  Health  recently  established  the  Extracellular  RNA 
Communication  (ERC)  program  [135]  to  explore  various  properties  and  applications  for  extracellular 
miRNA  in  circulation. 

5.2.  Sample  type,  Collection  and  Processing 

Donor-related  factors  such  as  the  feeding  state  (e.g.,  fasting  V5.  non-fasting,  food  lipid  content,  etc.), 
time  of  blood  draw,  gender,  and  female  hormone  cycle  may  all  affect  the  spectrum  of  circulating 
miRNA.  However,  the  impact  of  these  factors  is  largely  unknown  at  this  point.  The  type  of  blood  product 
analyzed  (serum  V5.  plasma)  has  also  been  shown  to  affect  the  spectmm  of  circulating  miRNA  [136]. 
Several  known  issues  associated  with  the  practice  of  blood  collection  and  serum/plasma  preparation 
are  listed  below: 

•  The  use  of  small  diameter  needles  (23  gauge  or  above)  should  be  avoided  due  to  shearing 
induced  hemolysis  of  red  blood  cells  (RBCs)  [137],  which  contain  abundant  miRNAs  and  can 
alter  profiling  results  [119,138,139]. 

•  Eot-to-lot  variation  in  the  production  of  blood  collection  tubes  may  influence  results  [140];  if 
possible,  purchase  draw  tubes  from  the  same  lot.  Discard  expired  tubes,  as  these  may  have  lost 
vacuum  and  cause  variation  on  the  final  concentration  of  anticoagulant  in  the  blood. 
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•  Skin  contains  abundant  epithelial  miRNAs;  thus  a  precursor  blood  draw  or  a  blood  tube  that  is 
drawn  and  discarded  [141]  using  the  same  needle  and  tubing  can  prevent  skin  eells  from 
eontaminating  a  blood  sample  (espeeially  for  studies  of  epithelial  eaneers). 

•  Anticoagulant  choice  is  also  eritieal;  this  should  be  based  on  the  requirements  for  analysis  and 
uniform  aeross  the  study  (using  only  one  type  of  antieoagulant  tube  with  the  part  number  and 
vendor  speeified  in  the  protocol).  Heparin  is  a  well-established  inhibitor  of  PCR  and  should  be 
avoided  [142];  EDTA  tubes  have  been  widely  used  instead.  In  the  event  heparin-eontaining 
samples  are  required,  heparinase  treatment  prior  to  analysis  has  been  shown  to  inerease  miRNA 
detection  [143]  but  is  likely  to  introduee  additional  variability. 

•  Antieoagulants  and  blood  samples  should  be  gently  mixed  in  the  tube,  as  shaking  ean  eause 
hemolysis  [144].  Serum  eoagulation  eonditions  (time,  temperature)  and  use  of  serum-separator 
polymers  should  be  standardized  between  eases  and  eontrols. 

•  Blood  should  be  expediently  proeessed  and  the  time  allowed  between  draw  and  proeessing 
should  be  stated  in  the  protoeol.  Speeimens  exeeeding  this  time  limitation  should  be  flagged  and 
noted  in  the  data  analysis. 

•  Insulated  eontainers  with  uniform  temperature  should  be  used  in  paeking  speeimens  for  transit.  If 
environmental  eonditions  inelude  possible  extreme  heat  or  eold  exposure,  deviees  that  ean 
indieate  if  the  speeimen  has  exeeeded  a  threshold  temperature  should  be  used  [140]. 

•  Standardized  eentrifugation  eonditions  used  to  prepare  eell-free  blood  fraetions  (time,  temperature, 
g-foree,  rotor  type,  aeeeleration/deeeleration  eonditions)  are  also  important  as  residual  platelets, 
cell  debris,  etc.  can  alter  miRNA  abundanee  [120].  Blood  fractions  require  expedient  separation 
from  eell  pellets  to  prevent  eontamination  with  eellular  debris  and  eontents  [138,145]. 

•  As  blood  eells  contain  higher  eoneentrations  of  miRNAs  than  plasma,  care  should  be  taken  in 
aspirating  plasma  and  serum  to  prevent  eellular  earryover  [146]. 

•  Centrifugation  to  remove  debris  or  preeipitates  from  body  fluid  samples  prior  to  RNA  extraction 
needs  to  be  standardized,  as  it  will  alter  miRNA  profiles. 

•  To  avoid  unneeessary  freeze/thaw  eyeles,  if  possible  samples  should  be  stored  as  single  use 
aliquots,  with  volumes  eorresponding  to  those  intended  for  analysis. 

•  Freezing  and  storage  eonditions  should  be  standardized  (snap  or  slow  freezing).  In  addition, 
ease/eontrol  speeimens  should  be  matched  as  elosely  as  possible  for  storage  time  in  the  freezer. 

5.3.  RNA  Isolation 

The  method  used  to  purify  RNA  impaets  yield  and  the  speetrum  of  RNA  isolated  [142,147]. 
Methods  based  on  phenol-chloroform  extraction  and  silica-membrane  based  eolumns  have  been 
observed  to  enhanee  deteetion  of  eireulating  miRNAs  relative  to  methods  that  omit  either  of  these 
steps.  Omission  of  chloroform  extraction  may  be  espeeially  problematie  for  blood  samples  eontaining 
high  lipid  eontent  (e.g.,  blood  drawn  after  high-fat  meals,  etc.).  miRNA-speeifie  methods  are 
important,  as  buffers  for  miRNA  extraetion  often  utilize  higher  ethanol  eoneentrations  than  their 
mRNA  eounterparts.  Whether  or  not  to  enrich  small  RNA  fraction  is  yet  to  be  determined,  sinee  there 
are  reports  indieating  the  alteration  of  small  RNA  profile  after  enriehment.  Addition  of  earrier  RNA 
(e.g.,  MS2  baeteriophage  RNA)  [148]  has  been  observed  to  improve  miRNA  quantifieation,  but  more 


Molecules  2014, 19 


6089 


data  is  needed  to  demonstrate  its  reliability.  In  addition,  carrier  RNA  may  also  affect  downstream 
profiling  methods  (e.g.,  next-generation  sequencing).  The  use  of  low-adhesion  plasticware  is  critical, 
as  standard  plastics  can  adsorb  miRNAs  and  skew  quantification. 

MiRNA  derived  from  exosomes  and  other  vesicles  has  been  a  significant  focus  of  study,  however; 
the  purification  method  used  to  isolate  vesicles  will  impact  the  type  of  vesicles  that  are  collected  and 
therefore  their  miRNA  content.  Isolation  of  lipid  vesicles  is  generally  performed  via  differential 
centrifugation  [149-151].  The  centrifugation  force,  type  of  rotor  (e.g.,  fixed  angle  V5.  swinging  bucket) 
and  viscosity  of  the  sample  are  important  variables.  Exoquick  (EXOQ5A-1,  System  Biosciences, 
Mountain  View,  CA),  a  polymer  solvent,  facilitates  precipitation  of  exosomes  [152,153]  but  may  also 
precipitate  non-exosomal  vesicles  and  protein-miRNA  complexes  [154]. 

5.4.  Data  Correction,  Normalization,  Standards  and  Avoiding  Contamination 

Controls  for  variation  in  sample  quality,  processing  and  analysis  can  be  introduced  throughout  the 
workflow.  Hemolysis  can  be  estimated  using  colorimetry  (less  sensitive)  of  the  blood  product 
specimens  or  by  analysis  of  miRNAs  abundant  in  RBCs  (e.g.,  miR-451)  [119,155].  Normalization  to 
correct  for  variability  during  sample  purification  is  typically  achieved  through  the  use  of  synthetic 
oligoribonucleotides  (with  low  homology  to  those  expressed  in  the  species  of  interest)  spiked-in  to  the 
samples  after  thorough  denaturation  [146].  Additional  spike-in  standards  (e.g.,  UniSp6  and  UniSp3, 
Exiqon)  can  be  introduced  prior  to  cDNA  synthesis  and  PCR  to  estimate  efficiency  during  these  steps 
and  normalize  results  for  comparison. 

Standard  curves  are  typically  prepared  by  serial  dilution  of  synthetic  oligoribonucleotide  standards. 
To  ensure  reproducible  quantification,  these  standards  should  be  prepared  as  single  use  aliquots  in  low- 
adhesion  plastics.  The  choice  of  diluent  in  which  to  prepare  the  standard  curve  is  important,  as  the 
solution  matrix  alters  the  performance  of  the  assay  significantly  [156]  through  effects  on  analyte 
recovery  from  liquid  transfers,  enzyme  and  primer  efficiencies,  etc.  The  most  appropriate  choice  is  a 
diluent  that  approximates  the  complexity  of  the  samples  (e.g.,  total  RNA  prepared  from  plasma)  or  the 
use  of  an  abundant  carrier  RNA  to  dominate  matrix  effects  (e.g.,  MS/2  bacteriophage  RNA)  [148].  It  is 
possible  that  assays  may  cross-react  with  sequences  present  in  the  carrier  RNA;  thus,  the  tolerance  of 
the  assay  for  the  carrier  should  be  empirically  determined. 

As  the  concentration  of  standards  and  analytes  span  several  orders  of  magnitude,  care  must  be  taken 
to  avoid  cross  contamination,  especially  with  concentrated  stocks  of  synthetic  standards,  target-rich 
biologic  samples  and  post-PCR  products.  Contamination  events  have  proven  a  serious  distraction  for 
blood-based  molecular  diagnostics  [157-159].  Analytic  layouts  should  include  multiple  no-template 
controls  distributed  over  the  analysis  plate  as  sentinel  wells,  to  detect  possible  aerosol  contamination  of 
wells  during  plate  manipulation.  Dedicated  pre-PCR  workspaces,  equipment  and  protocols  should  be 
used  [144,160,161]. 

5.5.  Technologies  for  miRNA  Profiling 

A  variety  of  methods  have  been  developed  for  miRNA  profiling  [162,163],  but  due  to  their  short 
sequence  length,  end  region  sequence  variation  (e.g.,  isomiRs)  and  high  sequence  conservation  among 
family  members,  accurate  measurement  for  miRNA  is  nontrivial.  In  addition,  the  wide  range  of 
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individual  miRNA  concentrations  in  body  fluids  and  number  of  elinieal  speeimens  needed  to  generate 
statistieally  meaningful  results,  require  that  methods  seleeted  need  to  have  suffieient-sensitivity  and 
dynamie  range  with  reasonable  throughputs.  Commonly  used  miRNA  measurement  methods  inelude: 
hybridization-based  approaehes  (e.g.,  Agilent  mieroarrays.  Nanostring  nCounter)  [164]  reverse 
transeription  quantitative  PCR  arrays  (RT-qPCR,  e.g.,  TaqMan  TLDA  mierofluidie  eards,  Exiqon 
mieroRNA  Ready-to-Use  PCR  panels,  etc.)  and  next  generation  sequeneing  (NGS,  e.g.,  HiSeq  2000, 
SOLiD,  Ion  Torrent,  MiSeq)  [165].  Eaeh  teehnique  presents  relative  strengths  and  weaknesses,  and  the 
ehoiee  of  approaeh  is  dependent  upon  the  researeh  need. 

Hybridization-based  methods  are  well  established,  have  eonsiderable  throughput.  However,  array- 
based  approaehes  have  limited  sequenee  speeifieity,  sensitivity  (requiring  ng-pg  of  total  input  RNA), 
relatively  limited  linear  and  dynamie  ranges  (four  orders  of  magnitude)  and  are  diffleult  to  use  for 
absolute  quantifleation  and  eannot  identify  novel  miRNAs 

RT-qPCR  arrays  are  also  well  established  and  are  more  sensitive  than  hybridization-based 
approaehes  (requiring  <  ng-pg  of  total  input  RNA),  have  wider  dynamie  range  (six  orders  of 
magnitude),  and  ean  be  followed  by  absolute  quantifleation  on  the  same  instrument,  but  with  lower 
throughput.  However,  like  hybridization-based  approaehes,  prior  knowledge  regarding  the  miRNA  to 
be  measured  is  required,  the  speeifle  assay  design  is  non-trivial  and  this  approaeh  eannot  be  used  for 
novel  miRNA  diseovery. 

Next  generation  sequeneing-based  methods  ean  identify  novel  miRNA  sequenees,  distinguish 
isomiRs,  and  show  substantial  dynamie  range  (five  or  more  orders  of  magnitude)  although,  this 
approaeh  requires  signifieant  starting  material  (ng-pg)  and  eomputational  support.  The  analysis  of 
multiple  samples  during  a  single  run  is  possibly  through  “bareoding”  in  whieh  eaeh  RNA  sample  is 
tagged  with  a  6-nueleotide  long  identifying  sequenee  [166].  Relative  quantifleation  between  datasets 
ean  be  performed  using  a  digital  gene  expression  profiling  approaeh  [167].  A  major  limitation  of  NGS 
is  that  it  is  subjeet  to  eritieal  sequenoe-speeifie  bias,  amplifying  the  deteetion  of  some  miRNAs  and 
redueing  others.  Systematie  investigation  has  revealed  that  this  bias  is  primarily  derived  from  eDNA 
library  preparation  and  not  the  NGS  platform  used  [168-172].  Using  a  473-member  synthetie  miRNA 
test  set,  researehers  observed  variability  in  miRNA  abundanee  of  up  to  four  orders  of  magnitude  by 
digital  gene  expression,  depending  on  the  method  used  to  eonstruet  the  library  [168].  Variability  in 
library  preparation  has  been  attributed  to  sequenee  preferenees  by  ligation  enzymes  that  are  used  to 
attaeh  adapter  oligonueleotides  [168,170,172],  differenees  in  RNA  seeondary  strueture  [173]  and 
amplifieation  by  PCR  [174-176].  Remarkable  sequenee-speeifie  differenees  in  produet  yields  have 
been  observed  using  both  T4Rnll  and  T4Rnl2tr-oatalyzed  ligation.  In  the  ease  of  T4Rnl2tr-based 
adapter  ligations,  sequenee  bias  was  substantially  alleviated  by  the  use  of  a  pooled  mixture  of  5’-  and 
3'-  adapter  sequenees  that  varied  at  the  two  terminal  nueleotides  proximal  to  the  ligation  [172].  RNA 
seeondary  strueture  may  also  impaet  analysis  by  sequeneing,  as  RNAs  with  3 ’-stem  struetures  are 
underrepresented  in  sequeneing  reads  [173].  Adding  further  eomplexity  to  the  problem,  RNA  may  eo- 
fold  with  adapter  sequenees,  sequestering  ends  and  making  them  unavailable  for  sequeneing.  PCR 
indueed  bias  in  NGS  library  eonstruetion  has  been  observed  [174]  and  is  a  funetion  of  G/C  eontent  in 
the  sequenee.  Additional  denaturation  [177]  and  optimized  PCR  buffers  [174]  have  been  reported  to 
reduee  this  bias.  The  use  of  non-amplifieation  dependent,  third  generation  sequeneing  teehnologies 
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(tSMS,  Helicos,  Cambridge,  MA)  [178]  may  circumvent  NGS  limitations  attributable  to  PCR,  but 
these  technologies  are  not  widely  available. 

5.6.  Quantification  and  Validation 

Lead  miRNA  biomarkers  candidates  identified  through  initial  profiling  are  often  directly  quantified 
individually  using  RT-qPCR,  an  analog  technology  based  on  a  fluorescent  signal  output  relative  to 
external  standard  curves  [179].  This  approach  introduces  multiple  significant  sources  of  intra-  and 
inter-assay  variation  (due  to  differences  in  preparation,  unequal  amplification  efficiency  between 
standard  and  target,  Monte-Carlo  effect,  variations  in  thresholding,  etc.  [179-181]).  Digital  PCR  [182] 
is  an  absolute  method  of  nucleic-acid  quantification,  which  partitions  target  molecules  across  many 
replicate  reactions,  followed  by  end-point  PCR.  In  the  presence  of  amplification-dependent  fluorescent 
probes,  each  reaction  generates  a  highly  resolved,  digital  output  signal  corresponding  to  the  presence 
or  absence  of  starting  template.  The  starting  concentration  of  template  is  then  determined  by  Poisson 
statistical  analysis  of  the  number  of  positive  and  total  reactions.  Although  this  concept  was  introduced 
early  in  the  history  of  PCR  [183]  and  presents  theoretical  advantages  over  RT-qPCR  based  methods, 
practical  limitations  on  dynamic  range  and  throughput  subsequently  limited  the  use  of  dPCR. 
However,  recent  advancements  in  instrumentation  and  chemistry  [184,185]  circumvent  these  obstacles 
by  using  rapid  microfluidic  analysis  of  nano-  to  picoliter-sized  droplet  partitions  (BioRad  QX-200  and 
RainDance  RainDrop,  respectively),  enabling  the  practical  analysis  of  sizable  sample  sets.  Droplet 
digital  PCR  (ddPCR)  is  highly  precise,  because  as  an  absolute  method  of  quantification  it  is  not  reliant 
on  reference  standard  curves.  ddPCR  has  recently  been  shown  [156]  to  reduce  analytic  variability 
between  sample  preparations  37%-86%  and  to  improve  day-to-day  reproducibility  in  the  analysis  of 
miRNAs  from  prostate  cancer  patient  serum  by  a  factor  of  7  compared  to  RT-qPCR.  As  an  endpoint 
method,  ddPCR  is  also  resistant  to  residual  PCR  inhibitors  that  may  be  present  in  samples  [142,186], 
providing  further  robustness. 

A  basic  set  of  assay  performance  parameters  should  be  included  in  each  assay  such  as:  limit  of 
detection  (LOD)  [187],  linear  dynamic  range  [188],  PCR-efficiency  (where  applicable)  and  precision. 
The  determination  of  these  parameters  would  aid  in  study  reproducibility  by  eliminating  spurious 
results  based  on  data  generated  outside  the  operating  range  of  the  assay  and  provide  quality  assurance 
that  the  analysis  is  based  on  valid,  reproducible  results.  The  Minimum  Information  for  Publication  of 
Quantitative  Real-Time  PCR  Experiments  (MIQE  Guidelines)  provides  a  detailed  framework  for  the 
analysis  and  reporting  of  miRNA  quantification  results  (derived  from  RT-qPCR  or  other  methods)  [189]. 
In  addition,  a  concise  set  of  criteria  to  be  reported  in  cell-free  miRNA  studies  [190]  has  been  proposed 
for  use  with  MIQE: 

•  Sample  collection:  collection  site,  gauge  and  type  of  needle,  elapsed  time  between  collection  and 
processing  (including  clotting  time,  if  applicable),  processing  conditions,  storage  conditions 

•  Sample  quality  control:  Age  of  sample,  hemolysis  measurement  and  cut-off  criteria 

•  RNA  isolation:  sample  volume,  isolation  kit/reagent,  carrier  used,  spike-in  used, 
elution/resuspension  volume 
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•  RT-qPCR:  primer  sequences  or  assay  IDs,  template  amount,  RT  reagents  and  conditions, 
preamplification  (if  used),  cDNA  dilution,  qPCR  reagents  and  conditions,  instmmentation,  software 

•  Data  normalization:  Equations  for  normalization,  relative  quantification,  standard  curves, 
variability  of  endogenous  controls,  %  recovery  of  spike-in,  raw  data 

6,  Future  Prospects  and  Recommendations 

The  discovery  and  analysis  of  miRNA  biomarkers  from  biofiuids  entails  substantial  experimental 
detail  and  complexity.  Although  substantial  knowledge  has  been  acquired  regarding  circulating 
miRNAs  and  their  applications  for  various  pathologies,  their  clinical  diagnostic  potential  has  yet  to  be 
realized.  In  part  this  is  due  to  the  lack  of  fundamental  understanding  of  various  intrinsic  and  extrinsic 
factors  that  may  affect  the  spectrum  of  circulating  RNA,  but  assay  variability  and  the  lack  of  standard 
protocols  for  miRNA  measurement  also  play  a  role.  To  further  advance  the  field,  we  will  need  to  gain 
knowledge  on  the  state  of  miRNA  in  circulation,  develop  commonly  accepted  standard  operating 
procedures,  introduce  controls  to  assess  the  consistency  of  the  results  at  different  stages  and  develop 
proper  normalization  strategies.  In  order  to  improve  the  accuracy  and  reproducibility  in  the  field,  and 
with  the  aim  of  realizing  their  clinical  potential,  we  make  the  following  general  suggestions  for  future 
studies  of  miRNA  biomarkers  from  biofiuids: 

•  To  the  extent  that  is  practical,  match  patient  with  control  samples  as  closely  as  possible  with 
regard  to  all  relevant  information  (age,  sex,  smoking  status,  etc.)  except  the  disease  status  in 
question  (e.g.,  cancer  or  benign).  Minimize  possible  confounding  variables  by  explicitly 
specifying  detailed  inclusion  and  exclusion  criteria  for  studies  and  collecting  specimens  from 
donors  in  the  most  consistent  manner  possible  (time  of  day,  feeding  status,  etc.). 

•  Estimate  the  practical  variability  associated  with  collection  (time  before  processing,  etc.),  storage 
(freezing  conditions,  time  in  freezer,  etc.),  purification  (e.g.,  precision  of  extraction  methods)  and 
analysis  by  performing  basic  pilot  experiments  with  similar  specimens  and  report  the  results  as 
supplemental  materials. 

•  miRNAs  that  are  highly  expressed  in  abundant  blood  cells  (e.g.,  miR-451  in  RBCs,  [119,138]) 
are  likely  to  be  derived  from  these  cells  and  not  from  the  diseased  tissue,  in  addition  to  being 
sensitive  to  hemolysis,  etc.  They  are  therefore  unlikely  to  be  suitable  as  robust  biomarkers  and 
should  be  filtered  from  the  data  or  at  a  minimum  interpreted  with  caution. 

•  Validate  measurement  results  using  different  platforms  (e.g.,  NGS  followed  by  RT-qPCR). 

•  Report  the  essential  variables  outlined  in  the  MIQE  Guidelines  [189]  and  those  recommended  for 
cell-free  miRNA  studies  [190]. 

It  is  also  important  to  note  that  the  combination  of  miRNA  and  protein-based  biomarkers  may 
provide  more  robust  and  sensitive  measurement  as  they  are  susceptible  to  different  types  of  errors. 
Despite  the  difficulties  of  getting  consistent  miRNA  measurement  results  with  our  current 
technologies,  we  believe  implementing  and  following  a  better  sample  preparation  guideline,  and  new 
measurement  technologies  such  as  isothermal  amplification  will  facilitate  the  development  of  miRNA 
based  biomarkers  in  clinical  setting. 
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